Finding High-Frequent Synonyms of A Domain-Specific Verb in English Sub-Language of MEDLINE Abstracts Using WordNet

نویسندگان

  • Chun Xiao
  • Dietmar Rösner
چکیده

The task of binary relation extraction in IE [3] is based mainly on high-frequent verbs and patterns. During the extraction of a specific relation from MEDLINE English abstracts, it is noticed that besides the high-frequent verb itself which represents the specific relation, some other word forms, such as the nominal and adjective forms of this verb, as well as its synonyms, also play a very important role. Because of the characteristics of the sub-language in MEDLINE abstracts, the synonym information of the verb can not be obtained directly from a lexicon such as WordNet [1]. In this paper, an approach which makes use of both corpus information and WordNet synonym set (WN-synset) information is proposed to find out the synonyms of a domain-specific verb in a sub-language. Given a golden standard synonym list obtained from the test corpus, the recall of this approach achieved 60% under the condition that the precision is 100%. The verbs corresponding to the 60% recall cover 93.05% of all occurrences of verbs in the golden standard synonym list.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Construction of Persian ICT WordNet using Princeton WordNet

WordNet is a large lexical database of English language, in which, nouns, verbs, adjectives, and adverbs are grouped into sets of cognitive synonyms (synsets). Each synset expresses a distinct concept. Synsets are interlinked by both semantic and lexical relations. WordNet is essentially used for word sense disambiguation, information retrieval, and text translation. In this paper, we propose s...

متن کامل

Detecting Multiword Verbs in the English Sublanguage of MEDLINE Abstracts

In this paper, we investigate the multiword verbs in the English sublanguage of MEDLINE abstracts. Based on the integration of the domain-specific named entity knowledge and syntactic as well as statistical information, this work mainly focuses on how to evaluate a proper multiword verb candidate. Our results present a sound balance between the lowand high-frequency multiword verb candidates in...

متن کامل

Task-Specific Artifacts of Parametric Properties in English as a Second Language Acquired by Persian-Speaking Learners

This experimental study investigated the learners’ integrative acquisition of obligatory overt subjects and subject-verb clause agreement in English as an L2. In L1 acquisition research, correlations between superficially unrelated linguistic phenomena are analyzed in terms of integrative effects. For instance, in English L1 acquisition, there is evidence for an integrative appearance of subjec...

متن کامل

Functional analysis of Subject and Verb in Theses Abstracts on Applied Linguistics

The purpose of the present study is to analyse abstracts related to Applied Linguistics, and more precisely the discourse functions of grammatical subjects and verbs. The corpus consisted of 50 PhD thesis abstracts written on the subject of Applied Linguistics. All of the abstracts were written from 2010 to 2014. The theses from which the abstracts were extracted are available in the ProQuest d...

متن کامل

Functional analysis of Subject and Verb in Theses Abstracts on Applied Linguistics

The purpose of the present study is to analyse abstracts related to Applied Linguistics, and more precisely the discourse functions of grammatical subjects and verbs. The corpus consisted of 50 PhD thesis abstracts written on the subject of Applied Linguistics. All of the abstracts were written from 2010 to 2014. The theses from which the abstracts were extracted are available in the ProQuest d...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004